Model Selection

High-throughput Inference

# High-throughput Inference

Qwq 32B INT8 W8A8

INT8 quantized version of QWQ-32B, optimized by reducing the bit-width of weights and activations

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase